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AISTBACT 

A specific application of tha procesa o£ autoaatmc 
azans for any Introduatory statistics course Is dasc^lbed. The v 
process of aotoaatlng axafs v^s accoapllshed by aslngtha Statist: 
lest Iten collection Systein (S7ICS) « This system ira^ first used t( 
salact a sat of ^uastlons based on course raqulreffients estii^llshaf 
advance; aftertrard, SXICS vas utlllidd to generate aultlpla mxkmi 
froiB this collactlon of guest Ions, The procedi^re for autbfflatlng 
Involvad five stepsi (1) ^establishing course objectives for the 
aaterlal to he covered; (2) ' selecting a guastlon frame# or 
collection^ tovreflect the established objectives; (J) editing 
extensive ravleM of the. guastlons consldarad for use; C^) refln^ 
the guastlon frame and possible additions to Iti and (5) generi 
ejcans using the convarpatlonal program BlQEZ^ Thasa five steps /ware 
covered In details The exam production propess was Illustrated/ by 
applylflg STICJ In the context of an 'Imaginary course that purp/orted 
to promote student adoption' of the critical^ guestlonlng QUtl^ok 
expressed In fiuff*s book Hon to Lie with statistics. Fifteen/ 
objectives defining tihat Is expected of students cQmpletlng the 
cbursa, and exam guestlons and ansuer^^ are provided in appindlcesi 
XAutbor/iLi^ ^ ' ' ^ // 
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^, ^^^^-.-jes a specific afiplieation of the process 6f automating ^ 
exams lor a coUrse. Thfe- process of, autpmating^ ex^ij^^was accomplished by using 
the S^atistiqai Test Item CoJllection System;' STICS (J. Warren, at al. , 1975) . 
This system was first used select a set' of questions based on course rer 
qulremants established in^ advance; afterward , STICS was utillEed to genernte^ 
.multiple eKams from tihis collection of questions, ■ ^ 

Since our purpose is to ""show' how STICS can be used to prepare exams for 
any introductory statistics course^j we chose to illustrate' the exnm production 
process ^y making up a new course tather than use one of the courses involved 
in the development of STICS* The, book How to Lie wltli. Statistics (D. Huffj 
Norton & Co*, N,Y. , 1954) was chosen as a starting point for. our now coursij* 
The application of STICS is^in the context of an imaginary course that pur- 
ports to promote student adaption of the .cfitical, questioning' outlook expressed 
in How to Lie with S tatis tics. * ^ " . 



'eric 



THE PROCEQURE 



The procedure for automating exams. involves five stepsf* (1) establishing 
course objectives for the material to be covered ; (2) the selection of a ques-^ 
tion frame, or collection, to , reflect the established objectives; (3) editing 
and extensive review of the questions tonsidered for use; *^(4) further refine- 



ments to question frame "!and possible add^tipns to it^and (5)' generating 
exams* These five steps will bi covered in detall- 

Step, 1 was^^establish our objectives for this new course* Guldielines 
In Preparing Inst rue tional Qbjedtlves (R,F. Mager , Fearon* Belmont Calif * , 
1962) were followed to' determine. our course objectives excepe that we did not 
attempt to follow Mager's approach for the establishment of specific minimum 
requ»ements/ We found a thdrough^ ^e^examination of the^ matterial in How to 
Lie with Statistics necessary in order to identify explicitly what, behavior ^ 
or abilities, would be required of students when they finish a course on this 
material* For .example , Huf f devoted an entire chapter to the three common 

. ' • - - " .^r^ - ' - ' : ' ^ ' ^ . , ■ ^ ■ ■ . . > 

"averages" I mean s ^mediani^ and mode* These three measures were clearly dis^ 

tinguished with examples of 'which . types of situatidns would best have the 

■- > * ' . - " .* 

"average" defined as a mean, median^ or mode* Applications as-well as compu^ 

tatioiial eKamples. were providedp We agread on the following three spociliq . . 

behavioral objectives to cover this topic: 

( i) Given a value said^ 5o be^ an average, a^tudent should recogniEe 
i^iw ■ . ■ -■ . . 

the need "to know whether the average is a siean, medi^an, or mode, 

* ■ 

.. (2) A student shpuld be able to identify which average is most ap^ / 
plicable, or informative 5 ^ for a given siq^ation, 
(3) Given a data set, the student should be able 'to determine the 
mean, median, and mods'* - 

A total of 15 objectives were decided upon to initially define what* is ex- 
pected of students completing this course (see Appendix JL) , While we recognized 
that this set of objectives sliould be regarded as a trial set,, we also hopad 

• ^ . . . . - ^ - ^ . 



that it would not be necessary to return to this activity, * * 
^1 This process of establishing course objectives for How to Lie with Stat= ' 

ist:|Lcs proved time . consuming. Identifying and stating succintly what. we= ex- 

L • ' • ^ '-'^ ' ■ 

peeped of student behavior after completing the material took approximately 

itKr^e days* (This time inves tnent. was made up of one to three hour sessions 

^: : % ■/ . ■ ' > 

at the ends df severalsdays rather than three conseciftive days.) 

" ■ . " ... . ■ ■ . - /■ 

Step 2 involved the selection-of\^^a ques tion f rame^(collectlon) based on 

the objectives . established in Step* 1, T^s required (a) clioosing keyterms 

likely to^ be associated with questionif related to our choice of objectives; 

and (b) running the STICS program SELECT to extract these questions from a 

large collection of questions and answers , the National File* . , 

[ Each item in the National File haS^ associated with it a series of key- 

terms and question characte^stics (Technical Report)* Generally ^ keyterms 

indicate statistical topic coverage wh^le question characteristics, describe 

such things as estimated difficulty^ Question ^-type (essay , multiple ^ choice, 

... )Cetc. Our plan was to initially select on the basis of kayterms only in 

hopes of forming a large pool ^of , questions that could later be refined using 

question characteristics. ; . 

We began by Searching the list of keyterms for the National File (see 

Appendij^ 2) to find terms, likely to be associated wit|i our own set of object-- 

- # ■ 

ives. For exampliej -i*. ^ 



• _EKampjLe 1 . / ' j 

Objective 6. Given a data set, a student should be able to determine 

the means median ^ and mode. . \i 

keyterms chosen: ME AN, MED I AN, MODE » = 



Objective 7. A student should be ^^ble to identify J|raphs , or charts : 
>^hich have beere incorrectly .labeled ^r, havq truncated 

■ ' ■ • • - , ^' - ■ 

Keyterms chosen r TYPICApIC/GRAPH. COMIONPITF ALLS, HI ' 



V . . . . . - 



toample 3 



ective 3» A student should be. able to defiiiie pnd eJIplaiti A ^ 
" advantage^ of I random sampiiQg and; s tratif ied ^Bnmpling, 



Keyterms choseni Slt^LEMNpOM.a^RATIFIED^ " :^ ^ 



We spant^ ar6und two hours chobsing key terms i! ' ^ ^ 'r ^ ' ^ 

• " 0 ■ ■■ >' r-'T^^ 

Next, we used phe conv^sational program SELEuT several^, times tS obtain^a 

' . * ^ ^ . _ . _ ' ^ ' ' _ V* ^ ■ % ' ^ 

collection of questions linked to our chdices of keytjerms, SELEGf ^s used- 

■ / { ^ . : i . ' ' '.^ ' ' . - ' ' . ' - 

more ^^^a^^nce partly because "of a syste^ cMgh and partly because, we committed 

, \ ^ / . " ^ ..." h ^ 

a classical blunder* (SELECT begins by asking if the user is an eKpert. We 

modestly said yes* And missed^ a * message ^ ^.quf a of change.s 

S "%4_/ / 1 ^ ^ ■ » ^ ' 5 

occurred since our last^use -of 'the program*) Following Ch^ie difficulties ^ * 
which Stretched over a two day periWdj we did make a| 45 minute. run .to extract" 
an initial question pool of 194 quistions^ FRAME.(1) J We expected " that many of 

, but hoped ithpt it would provide mfs.t 
of the items needed for pWeparicm exams 'for otir nev 

: V " ' I ' 

SELECT appears in Appendix 3. / 




colirse, A sample run of 



# 

Step 3 involved the exanmiation and subsequent editing of FRAME(l). The 
full set of questions ^ere printed so we could review^ each item . We decided 
that 53 questions (27%) were acceptable including those that needed" minor moji-- 



ficatiori* Since TO wanted a larger ^ite^l pool% we reconiiderfed oyi!?*^^ 
of SELECT .and, madf some changes. The usual search. basis u§ed by SELECT Is to 
" retrieve items on the* basis of what are called primary key terms # These, are 

terms chat. tWe iquestibn calpsifier regards as close's t^ td the central topicts) 

- ' ■ V- - . >V ■ . r ^ / . ^: ^ ;/■/..■• - : / 

of .a qneB'tionp| In addition/^ .aacondary keyterms also ara^-assigned to items, 
. For eKampies a^keyterm such as ^CO^MNPITFALLS isn' t likely to be /attached Co 

-' ' . ' ' ' ' ■-■ '. ' \ : " •. » , - r = ' - : 9 * 

a question ae a primary f^ke^term, but it often will ^appear as a secondary key= 
terml ^We .modified our use of SELE8T so that it would search for both primary 

. . ' ;/t^' ' J , _ ^ - \ ' ■ ' ' s ^' ; ' . ■ ^ . ' ' ^ ■ 

and secondary kaytertns and 'made use of addtftlcwal keyterms that %e had negllcted 
: , \ " ■ ; ■ t ^ * ^* - / ' .-\ ' 

previously* Our new run of SELECT yielded 43 new ltem&. We fo'und only eight 

*6f these (19%) acceptable. ' The total. acceptabre now-was 61, ' 

■ • " " . ' ; ^ - ^' ' ^ • ^ 

At thia point we decided to , reconsider our initial/set of objectives'. We 
concludedf that we had allowpd How to |Li^ with Statistics to dom^^-tfe our choice 
of objectives t^*^tfe point that we had neglected to Include any objectives that 
coulM nqc be supported by that book alone, We had objectives that faithfully 
reflected How to fie with Statistics ^ but not our qwn judgements of what itu- 

- \' - ■ ^ * . " ' . : - M - ■ t : • . ■ ■ 

dents should gain from our course, ^We then revised our set of objectives , - 
-Vlargely by introducing objectives that include computational activities, and^ : 
the notion' ofVconf^dence intervals . (oiy ectivei 13 through 17 in Appendix 4) , 

* 'when How to Lie with Statistics wa&'puBlTshed (1954) it was probably very, 
reallstio to do no more than mention that a standard errpr is a useful piece 
of information! But, now it seems ° unwise not \o have students learn that they 
.themselve,s can obtain thip usaful piece of inform^ion (lising. a pocket calcu^ 
lator or computer) and to include doing so among the qbjoc tives oC'thls kind 
of course. Similar arguements led to the addi|^on of ^new objectives such as 
those dealing with confidence limits for a mean, etc, 

i . ■ : ■ ^ ^ ■ "' ■ 



i) 



Again %i?a re turqed to^^^^^GT and a^in we extracted and i^efined a new pool 
of questions. This reconsideration^ of djbj actives and re-e^traction of items 
•took three days to provide us wtth a collection of ^78 items, / ' _ 

&tep .4 involved extensive editing ofi our .qyj^stion' frame , to prepare for 

exam generation* In doing -^this we set^ up lour question collection so that exams 

could be prepared by using random snmpling^ within two question groups — one 

^ponsiscing of questio^ requiring less than five mli|utes, the other consisting 

of questions rated five jninutes or more* Within each of these groups we chose 

to ha'^e the frequency of questions Healing with' an 'objective dependent on the 
.- _ - ^ 

relative importance of the objecttye^ Question frequencies were i,n a 3i2:l 

■ .. • " ' ■ ■ . ' ^ 

racA with the highest frequency applying to the most important Qbjectiva^, 
Table 1 summariEed the distribution of ^questions that we diecided *upqji.. 

In arriving at this^ distribution of .questions we found it necessary to 
discard 31 questions and to write ^17 new quest Ions, If -we had not made this 
adjustment we would not ha;ve been in a position to employ the simple random - 
sampling technique that l^ter was used in forming exams. We would have had 
a great excess, of questions invqlving computational skills and' few, if iny , 
questions concerned with several other objectives. This process of adjust^ 
ment required approximately four days of effprt* Most of this time was con^ ^ 
sumed in writing new items and adding them to the^question frame. 'Although 
we had hoped for even morij on# of our rewards for having used the Ni'^tional 

' ■ ^ - ■ ' ( ' ' ■ ' " ^ 

File was that 47 items did not have to be Written, _ , . 

" . ,fc ^ - 

Step 5 was ac-tualj^y generating a series .of eKams* using 'the conversational 
program BEGET. We spend mrout|d ah hour using a terminal to obtain the three' 
exams listed 4n Appendix 5 (answer guidelines were also obtained )*.__Since wo ' 
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T^LE L — ^ DiscributiQn of questions in the irame sampled' for preparing exams 



OBJECTIVE 



RELATIVE 
iteORTANCE 



1. Identify how sample information was obtained 



2* Identify b^iae in a sample . : 

3. Define ^nd eKplain rand^om and stratified 
samples 

■ - . ' * • . \ . ■ 

4. Distinguish, among the three common averages 

5. Identify which average is most applicable 

J* Calculate a mean, median, and mode ^ 

Jdentify misleading or poorly construct^ 
■ ^ ' - graphs 

8^ The Effect of sample size on results 

9. Define and explain .level of significance 

10*, Define and explain the standard error of 
^ mean ■ ' . 

11* Recognize use of standard ^nd/or probable 
error 

12,^ Recognize that small differences may be 
meaningless - 

'^IS, Use of a random numbjers table 

14, Recognize the interrelationship of signifi'^ 
^ cdnce level j varlabiiity, tolerable error, 
and sample size 



15, Qalculate a range and ptandard deviation 



1 



^16* Test for a signif leant difference between 
, means ■ 

17* Create a' confidence interval for the mean 



M 

H 
M 
L 

H 

M. 
M 
M 



L' 
L. 

H 
M 

L 
H 



TOTAL QUESTIONS 



QUESTIONS 
IN FRAME y 



4 
4 

6 
4 
2 
'6 

4 
4 
4 



2 
2 

^6 
4 

2 
6 



64 



EKLC 



High importance; M: medium importance; L: Low importance ' ^ 

Half of chese quesiti^ns belong to each -of she two^groups based on as timated time 
•time required * 



; - ■ ■ , . ■ ■■ .-8- ■ ■ ■ 

■ ^-^ . • . ^ ■ _ ' ' " ■ ■ * ' . = , , ' . : ■ ■ 

used . a special ribbon and forms In* printing these exams we also had ah oyer-. 

' night deiay before thay were 'available.' Usuilly*we can; print exams within 

30 minutEs of running BEGET. ; ^ J : ¥ 

BEGET'.makes provision for g^enerating exams by sampling groups that ^ can . 

be dpafined using .either key terms ^of questifan characteristics. Key terms used 

can be either those that -appear in the National File or ftew terms added during 

the editing process. In this case, our question frame had been set up so that 

we didn't need to. use keyterms at all when examS were generated* ^ We only 

needed to specify time -required . in answering questions to form these exams. 

For each exam we asked fd%ione random sample M four questions from the group 

of questions requiring less > than fi^^ ^inutes and a second random sample or 

eight questions from the group s/t^quirlt^^ five mirjutes or more. - . 

,We would like to report that we were delighted with the three exams in 

* ■ V ''^ ■ ' , ■ ' 

Appendl^j but feel obliged to poirtt out that there were some difficulties. 
.Answers were not provided for two questions .Exati 2. - (This is being qor^ 

. ■ / ■ f , . * . \ s 

rected as questions and answers in the liational File unde-rgo external review.) 
tlore importantly. Question 10 in Ekam_ 3 was totally unacceptable. Somehow^ a^ 
question asking for flaws in a gr.aph that hadn't been .printed penetrated our 

i ^ * , ■ ' - / ■ * ' ' 

screening and editing activities and martaged t© come out of hiding' in our firs 
trial of the exam generator. App^ently, BEGET provides no pirotection against 

improper items* We humans are sclll needed. 

■ " ■ ■ ■ . ' = ' ^ / 

' • ' ' ' ■ . ■ ^ i 

• \ DISCUSSION 

• In retrospect, these efforts to automate exams for our How to Lie wi^h 
^Statistics probably were i^re typical than'^we really wanted.. We invested 

around'* ten days in setting up so that new exams can be generated with rela^ 



^tivel^i little ad44*lonal investment of effoct* While part of this start-up 
cost can be ass^sned to shortcomings in the files and programs now making .up 



STICSj most of tfiis time was required for activities that depend on people — _ 
including' .reading the instructions provided in "self-documented" programs' ' " 

■ ./^ r , > : r ' . ■ 

such as S|LECT. ^ ^ 

■ f J ■/ • , - ' ' ' . ' ' * ■ . /■ * ■ . -.^ 
; Setting objectives takes time. So rfoei resfetting objectives* ; Looking 

at items selected to reflect objectives ^takesr time. Discarding unacceptable 

items takes time* And.wri^ting new items takes more time, ji While programs may 

later be develbped to aid. in these activities , these are now and probably al* 

waVs will be predominantly people 'dominated activitieSv_ 

Then where do schemeg like STICS fit in? Even/thd'ugh th^ components of ^ 
■ ' ■ ' . ^ '' - ■ -. 

—STICS used in this exercise are stiir evolving s'^thBy probably are adequate \ 

■■ . . ■ ' ^ ■ ^ ^ ' - ' ^ ^ ' 

for picturing where the payoffs are*" By arid large, they seam tq he much 

attitudlnal as they are time-saving. They are attitudlnal in ' the-^ sense that 

they make -it seem credible that two or more people .can concentrate their 

efforts during a period' of a month or so and reach an^agreement on course^ 

objectives that are expressed in the' exams that students later will take*\ 

All of this dan be done without a scheme like STICS, but it rarely is'^done, 

STICS makes it possible to coi^vert Monday's agreements onj objectives .to Fri^^ 

day-'S discontent at the ^kinds of questions that^- thpse- objectives/lead to* 

^ \ * _■ - ^ -. ■ . ■ 

It makes it possible tOipush this process fast enough so ^at arriving at 

an acceptable question frame doesn't require six exercises in recallipg what'.s 

-^been done and where things 'stand now* ,It makes it realistic to bQllevu that 

a courae can be se^ up so that student exams consistently will , be , in agrefemeriC 

with eourse bbjectiv^s over a period of time, ; - - / 



■.■-10- ^ 

If a course dan be 'brojigUt past the start-up period so that^exams %rm^ 
*belng proflucedV the time savings^ become reasonably clear* In 15 to 20 min^ 
utes a BEGET user now .can set up "five exams like those in Appendix "5 These 
are exams ready for reproduction atid accompanied by answer guidelines. The 
savings in time are 4asy to see. ' ^ / ^ 

-The, problem ts arriving at'a state that will support exam generaEipn* 
Here STICS offers help that will ;vary widely from case t^ case.^ If STICS 



supplies 3/4 )of the questions and aoswers needed for a course, the time 
savings will be appreciable* But, our experience to date Suggests that 
may be high for the current version of the National File^ and that for some 
people it never will be high. Sometimes \ the reaction of people^ extracting 
questions is "1 didn't like most, Q'f the questions , but I con look at them 
and write new ones that I ^ liki, " "f^en ^ICS saves - time , but much' less 
tlnie than when hand crafting is regarded as necessary. . 

While the procedure we .followed may see:fti lengthy and time consuming, 
*^ two important fea&ures should be. noted i 

1) start-up efforts may be sybstantialj but these efforts should 

be repaid over several' ypars to come in terms of clear course h ^ 
. axpectatiojis and corresponding . exams ; : - ^ . ; 

2) . there should be improved consistency of course requirements* ^ ^ 
The purpose of this pap^ is to make explicit a metiiis of using the STICS 
rsystem to clarify objectives for a course and create examinations based upon 
those objectives* We attempted to show one means of u tiili zing, the -sys tern 

.■ y ' ■ ■ ' ■ - ■ . f 

for . an imaginary course using tTuff/s bo.ok How' to Lie wl_th^gt^tiatlcsu \fliile 
our example kep^ to a^ well--def ined small subset of material, thfi proqedure 
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we folXswed wouid -be the same for any course pbjecttves ufidep te 
Deplsidns oa objectlyes weightings^ etc* may indeed .differ, but hope ^. . 
' have: indicated a procedure which can bpriefit moMt people' inuol^ed with intro'- ^> 



ductory atSnJstics ins^truction. 



r * 
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Appaniiix 1 



Chapter 1 \ ^ 

f' 

1. Student should be able to look for and identify hpw sample information 
was obtained for a given study* 

2* Student should be able to identify bias in a sample as non^response , 
an unrepresentative sample 5 interviewer contamination ^ and tendency to 
give socially accGptable responses, 

3*^ The student should be able to define the terms random sample and 
stratified sample and explain the advantages of each* 

Chapter 2 ' . . 

1, Cfiven an average, a stMent should be able to distinguish which among 

the three common averages (mean,^ median, mode) has been used or demons tr ate 
some skepticism on the use of the average* 

/. ■ _ _ 

2* Student should be able ^o identify which common measure of. an average is 
most applicable, or informative, for a given situation 

3* Given a data set, the studetit should be able to determine the mean^ 

medians snd rnode. , , 

Chapter 3 " 

1* Student should be able to identify graphs/charts which have been 

incorrGctly labeled (or not labeled) or where axes have been truncated* 

2/ Given a scotistical report, a student should be abie to state the effect 
that a small sample size may have on the results 

3* Student should be able to explain why stating the level of significance 
is an important part of a report* „ . ^ 

Chapter 4 , 

1* Given a probable error or standard errors the student should eKplain or . , 
demonstrate how they are useful in interpreting a statistical report* 

2p Given a comparison between figures-with a small difference, the student 

'should utilize the concept of s tandard ■ error and recognize that small 

differences may be meaningless* ^ ' 

■ * 

Extra 

1. Student should be able to distinguish between the 'concepts of association 
and causality. 

2* The student should be able to identify whan a one-dimensional relation- 
ship has been misrepresentad by an improper two-dimGnsional graph. , 

s 

3. Student should be able to identify or give variables that are positively 
correlated or negatively correlated. 



AppfendiK "1 J i " > 

- 4, Studeivt s^uiuid be able' to demonstrate that they . can critically analyze 
a statistical report using the foriowing five questions as guidelines* 



1. 
2 . 

3, 

5. 



who says so? 

how does he know? 

what is missing? ^ . 

d id somebody change the sub j ect ? 

does it make sense? f 
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Tree .struct ure Listing ''of Key terms 



PEOBABILITy # . 1 

BASICTERMS/P # 
PHOBHODILS # 
. PRO B D is f R TB a^IQlil^# 

M U LT I V A B I k%B/ f^^^'^ 
UHIFORH m 
JOINTDISTRI BUTIO 
TDISTRrBUTION 
= FDISTRIBOTION 

XSQRDISTRIBUTION 
STANDDNITS/NORM A 



tf 
# 

« 

« 
# 



5 

"6 

rr 

8 
9 
10 
11 



TSCORE ' 






•12 


ZSCORE 




# 


1 3 


OTHER/ST - 






1 4 


POISSOH 


# 


15 




BINOMIAL 




16 




^ ' NORMAL 


# 


T7 




NORMALpAPER 




18 


GRMINGCUR7E 


# 


1 9 


OTHER/N 




# 


20 


Mr. M FN^<(^ FN P*M 


# 


21 




DENSTTYFN 




22 




CUrtULATIVEFN 


# 


23 
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COEPPVAEIATION # 230 
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ONETAIL/T # 100 
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KUrf b*fc.Lfc.U( 



. Are you a BELECT ejKpfijrt:^ ND 

Thlm is the QuestiDn FraiTia EK'^raotiifff^ -pra^ram* It al lowia r wgu 
to specify P.©^rf:t#rmm or chan^acrter imt icm thej*t^;^W reauirK? of 
^gestions to bm u^md on mxamm * - ^foid be^^ln b^Hfiipecif y :i,ri^ a 
Bindl© keh^tferiTi ancl thi^ cQiTiputcf?.r rft)plic?^m by indicatins^' haw'^ ^ 
^Tiarik Questions hl^y•e tha f)^M\.rlVf^ rm . You th<v?n may difcard mom^ 
of .thes(? «u©i5tionsr by admn^ anpihwr* k^yteriifi ar cTiaractc^ristlcr'^ 
that you T©auire. (i*^* yoii now say that you only want «uc3<ii^tiona 
having both the f i rst and mrtrond re^aui rem^nt ) The? coiTiPUtc^ r . ^ * 
then rep^^rtm how many Qu^mtions matimfy ybur rac^ui rcsm^ntis ♦ 
You may continued addin<3 r^Rui foments or you may aocBpt a 
fgroup of «ue?^tionmV""When^^ auestipnsy you.' 

' de-Pine a sesim^lnt* You may de^f ino ^se^'varal me^Uifi^fjtB in a mingles 

run of this pnparsm* dnte. or more se^iTi&ntm oomprimcj a ^ - 

jciuestipn and an§iw©r frai^* ^ thim, program itn the first stc^P in, 
storing a f rame) in ^ ^ - ^ ...... 



a f rame) 
to make up oHams^ 



fof^^H^a 



f:b:t©?# that can bte later ma 



During the rUn of BELECTV yqu ^will receive two kind^j of prqmpt 
, Command^ Uhioh calls for B word rc^sponmes^ YY>GCe ri^sponise " 

word HELF w:l11 remult in a list of the Uiprdis that c^n be^ 
used ss a remponse lahd an OHi^lanatidri -of what they cxo 
^Code No** Which 'calls f o r a ' numerics rejjponse y e # * -I'will ^ ^ 
result in a li^t of the s^luem that can be uised fiy"id an 
^ . explanation of what thew ^o . - 



ini'^ial k.eyterm reoUirement fojn 
allows lipecif ication of addi-- 
and CHaIacTEFJISTIC allowti^^ 
reGuirementfe ba^N?d on/ charac - 




&_ 



As^ a 



NEWSEG is used to provide an 
rseameht membership.*. KEYIiiiRM 
tional ke^term reou i rements 

specif icatiou of additional t \ 

^beristics such as ttuefrbion Type Co^ni tf^ Le 
i^y © rm vB l r i J 1 e y 5j 1 1 e y t e r m s ^ e a ij i a a^ i o n i^i a i j :L d 
before ^characteristics specifications im b^<^un ^ 

When the auestions were «smidned ke^terms the person assisynin 
asked - what keyterms woulW 1 use if I. W^g^ only allowed 'a few- t 
for this c^uestion? T4ie terms chosen, ^^re called primary Ikeytermm 
All searches are basad on prim^at^y key t;erm^rMjnless^ the comm/snd ^ 
^SRCH is used. to chans^e this basisu ' ■ . _ , 

. \ " \ - < / ^ 

When this pro^^ram selects fofe wXiestions havin*^ certain ^ * 
characteristics it wiSll%o 2>o on the basis of «ues;bions 
having a ' charac^^terist ic "e«ual to" ^what ham been specified 
Q,m% Find Questions for which Time F^eouired ip OQUal, to> 
sayy 10 minutes* When this is not the basis that you wAnt 
for dealing with characteristics y you ma^^^ chan^^e to a different 
basis for a single search by enter insrf -2 when Code No* ♦ is 
requested for a particular^ kiffd of characteristic* Then you 
can search for Questipns on a basis such as not 'eoual (NE)^ 
less thsrn <LT)y etc *' I Once the nent search has been completed 
th© search^ basis reverts to equality unless -2 is a^ain used 



^erifl 



Command: CHAfS^ACT 
Code No . * 5 
Time Code No* t -2 
Re 1 a t i D n a 1 V Co m m a n d ♦ 
Time Code No. I 10 



( s specif iem time 
( cman^e in reli 
LE (HELP produces 

(search i«j for w w.ywn ^. 

Required less than or ewual ioiiO Hinut/v^m) 



• Reciuired) 

rtlonO ^ . 

a 1 i st of commsnds ) 

Questions with tfime 



I 



^ftre you 

There fri'tl Bm a short iiou^ a*s the 'FNisrwit File 'is; oi-iantad 

CofifiTiand: NE'WSLtG • . ' • ' , 

■ . •Kiyter*iri Code No. J :l.6:l . ^ . 

"Thisre ..■) ro now.. 2 Qu(v>«>t ;i, ona :i. n SeMi^mejnt I 4:1' 

.CommandJ' DROP' LAST V ^ * ' . „ 

****- th3t Comm'and is not valid, at this tiiriKj- )K>K** 
CjjJTynsndfr' NEW SKS* \SE(3 ■ • " ; 



KfJwfce?riTi Code? • NOjisI ;l.52 



There are now 1 i. Qufijstions in SiffMiTi©nt , # 
YComma'ndS SAUE ' " 

\'% (suei5tioift«i mrm isav^d in Seamerit # "i" 
Cc/iriinandi NEWS£a/ 

I Command i I SAME . . 

1 QU^s bianij? .are sev^d in Be^mc^nt \2 
ComiTiandl >^EWSEG 



' Ther^ are now 1 Qf.ii?s t:Lanm in Se^s^nrrent ^ 3 
Command I SAME > . 



ComrriindP NEySEG/ ' . 

• Keyt^rm Code No,U 



U 



There are now 63 QuestlonfS in S^Mnt U'iu 
Command J SAUK ^ 

63 auemtions; are ?i.av^d^ in Sie^menb _ 
Command^ FINISH - ^ ^ ' 

. ^ ^ ■ - . ' ■ " . " 

Enter filG?^name' far storing . Wour ramults 
(FcDrmt XXXXX where X im a letter) t HUFF 

Da^'WQU want lyqur" Frame^orffer randDmi^ed? NO 

Th© nixt %tmf" in the |froce«iii. for creatinia wour own 
Idcsl file itis to tHpe the? Irommand ^. ■ 
.DO CREATE ^ 

sfter you leiuva thris pros* ram aiTd your local file will be? 
crested. .If yoLi miinply want a printout of your filoy 
you should twpe ■ 
.DO PTt-lkl ' ' . ' ■'■ 

STOP ^ 

ID OF EXEGiUT-lON , 9 " 

rnir>- riMEl 2.11 tlLAPSED TIME? 16J5'0.02 

hKjLT 



AppontlLx 4 ' 
Revised Set o I Objectivus 



^2 * 1. S^tiidont Bliould be able Co look for and identify> how s^ampleT information 
was ob*tained\for a givon study* ' * ^ . - • 

2 N 2, Student shguld be able to identify bias in n sample 'as caused by one of 

tlie following^ non^ response , an unrGp.rescntocivc snmpla, InCervicwer, 
coHtaminationf or tendendy to give^ socially accept'^ble responses* 

3 3* The Ftudent should be nble to doFine the terms rnndom snmpl^ nnd 
' 1 stratified sample and explain the advantages of each. 

t , ■ ^ = - ^ 

2 4* Gi^en an average^ a student should^e able to ^i^tinguasH whicli amorig 

the 'chree cotinion averages (metin, lined ian, mode) lins been used og 
demonstrate some skepticism .on tUre use oC the average. ■ 

1 5* SLudent should be able to idontiCy which common ^iiensure of an average 

is most applicable, or dnf ormative > for a-given situationt 

3 6* Olvon^a data sfet, the student should be able to determihe, the mean, 

median, and mode* ^ ... 

2 7* Student shouM ibe able to identify mipleafl^ing or' poorly constfucted 

graphs/charts, . 

2^ . 3*^- Given & statistical report , a student vsliould be able to state the >ef feet 
' .that a small sample size may have on tlie results • ■■ ' . 

2 9* Student should be al^ie to define the term level of significance and 

explainvwhy it is an important part of a'report* 

. 1 ^10* Student should be able to define or explain the term standard error / 
- of a mean* 

1 11. Given a probable error oc standard error, the student should eKplaiii 

or demonstratie how they are useful in interpreting a. statisticaj report, 

: • ■ ^: 'v ■ ■ . " : ^ ■ - 

i 12. Given a compnrison between results with a small difference, the student 
Bhouhl utilise the conrept "of standard error to I'eeognizc that small ^\ 
differences may be meaningless* \ 

1 13.' The Student should be able to eKplain the use of^a random numbers td^le 
to gdnerate a random sample> , 

3 14. When using n simple rniulom snmpling procedure for estimatinr, tlie moan 
* or propartlon of a population, a student Hhould be able to utlli^*e 

the inter^-relationship between the following four quantities^ level 
of signif^icance, tolerable error, estimate of the variability qf tVit 
population* and sample size* . 



15. The student should be nblc to determine or interpret the/flffige and 
' standard deviation of a data set or distribution. 





J"\ O^ven the moans of two 'iiHlepDmlont samples, ch^ sninplc variances and 
, ' that tho populntlon variances are aqual, the ptuiiGnc 

sUip'uld be able to test af the two means are significantly different* 

17.^jGiv^^^^ a samplG data sfet, the student ahoul^ be dble to calculate the 

*53|dMard ^errpr of the mepn and use this to] create a confidence interval 



mm 



\Q mean. 
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Append Irx 5 



{ ■ . \ ^ _ - 

Quistion # 1 5.0 Points 

The xange is d measuua of 

9 1. relative frequancy 

= i, variabilit y 

3, -^central tondency 

4* i^skewnass ^ 

* 5* none of these. 



Jiow to Lie with Sta'tistics 
Spring 1978 , . 
Exatn - t ' 



Question # 2 5*0. Points 

TRUE OR MHSE? IF ^ALSE,; CORRECT IT. 
^ IN A RANDOM SAMPLE W ITH R EPLACEfyiNI FROM A PDJ 

INDIVIDUAL HAS THE SAME CHANCE OF .BEING INCLhI 



ULATION^ EACH 
'ED IN THE SAHPLE. 



Quest ion # 3 

Ths standard' 
deviation of; 



5.. 0 Points 
err or of tTie 



mtfan i^ another name for the standard 



c, 
d. 



V a B a in p 1 

a population _ ^ v . 

J the sarapiiny distribution of' an y^^ta tistic 

th2 sampling distribution of the mean . 
none of the^' above' / ^ 




Question # ^ * 5.0 Points 

. laUE OB FALSE* :IF FALSE EXPLAIN WHl. 



I f - th.^^ results of an in vestigation show - that on e .sleepintj tablet 
works better than another at the 5% level of siynifica nce^ ths 
. oonclusion would be the same if t ested at t hp ; 1 0^ "levtf^^^^ 

:^ ^ - ■ .■ ^ --'^^^ ■ ' ^ . ■ ' ' " ^"-^ 

Quest iort #5 00 .0 points , 

you /are the par-?nt of two children that have been yiven an IQ 
tes^ that: purports .to be a sampling of " the,^ intellefct.,i One child 
" r ece Ivas a score = pf 99 and the ot hetf^^^c^ives 101. . igu know# . 
of coursb^ that the. IQ is based on 100 .as average or •nofcmal',' 
Does this imply that one is "above averaye*' and thp other is "below 
a veraye"? Explain. 



Question tf^ b 10.0 Pointa^ / ^ * ; ' \ > % 

. A jcoport idsued by d consul tin^%irin re^Aiued t)^^^ 

lar^je chemiGdl^|)iant is 'cpnGep^d ^with' number of fisb an a: river 
, Jsfefore an4 af ter the' plant was buiit. 'It statesi' '^^ r '/ ' 

■ ■ ■ ■ - ■ ■ " ^ ' • M - * ' . 

"Av^raye numher of fish hetore construction 100 , 

/Average number of 'fish one year ^fte? opera td'on began^ 25 . ^ 

Tli^s dif f eranca' is not siyificant at the levels kp thara is no 



yqod reason to regard plant operation as harmful,'' 



Do you ayree v^it h thia c^bnclusion ? Explain, 



"'Question # " 7 10,0 ^ints / ' / % 

The ' rt^sults of a bar pressing exper im en t f or a group of ' y 
rats yielded a aean ayual to ^25 presses/hour with a standard error of 
tho mean e^ual to 5 , I f ' tko ■ pqpuia tion .^aean bar pressing eg u-als 10 

" prBsaes/hb ur what is-the probability of someonfe:^ .being able to 
repeat the experiment with ' results of a sample mean *great^r 
than 25?' • , ^ ' 

yuestion » 8 .10^,0 Points , ■ . 

A newspaper editor vdnts to survey the attidudes of the 
Vpublic toward Littering, Discuss tlfe tela tive (Merits of 
using personal interviews or mailed questionnaire's as , 
methods of data collection, ^ ' 

i- : . ^'^ / -, ^"^ ^ ' ^ ' \ ^?' ' . ■ ^ 

Question ^ 9 10,0 Points ^ ' j 

In the 1 936 Presidential iLlectibn Franklin D, Uoose-velt defeated A 

E, Landon. in a laridsiide vote, A Landori victory had been predict 

the Literar yDigestV a; magazine' w hich ran the oldest* largest^ an 

^widely publicized of the polls at ^he.tlme,^ The Qigest^s final p 

-tion was based on ten million sam plebailots mailed to prospectivl 

ters an^d 2,3 million were returned. The sample_of vote rs' was d m 

lists of -aiitomobile and' telephone owners. Despite the massive si: 

of this tample;^ it failed to predict a Roosevelt victo^y^ being-off the 

mark. by 19 percentage poi ntbs Explain, why the. Digest was sp wroii'gi 

Quistib^n.*^ 10 10,0 Points . \ 

A SAMPLE IS CHOSEN BY NUMBERIMU ALL THE RED DODKS JN TilE LrDRARY 
AND THEN CHOOSING' T'HE^ONES TjrAT C OH R ES PO N D TO HAKDdM DIGITS IN * 

TABLE,^ \, • - ' . ' ■ 

IS'^THE SAMPLE I N DEP EN i^ilT? ' 
- 'IS IT A SIMPLE. RANDOM. SAMPLE? = _ " 

^^--OT WHAT PO^PULATION? 

C (EXPLAIN YOUR ANSWERS REASONABLY) . ^: ' 




Quastidn # 11 ' 10,|0 Pointa. 

, ^br the fDllowiny datd ^ 
= . \ ; B, 6, 9, 10, ^6, 6, ni 

' ? Calculate: . '[....-. 

=Y ■ #) • • inean^ . . ■ ' . 

b)=: ..nied ian ^ . - >t . 
^ = . ' pjH^. fnode ^ ' ' ' ^ \ 
* , ^) range ^ / 
. ' a) var i^nce ; " 



Q bastion # IE ^ 10 ^.0'/ Points . ^ ^ 

tI' THE OBSERVATIONS: 1^, 9, irif 12, 15 ARE A^SAMPL€;Or' 

SIZE 6 FROM %i^\y, 24) , THEN ^A 95'%/ fcoNPlDENtS :1N^ 
FOe .^U HAS CONFIDENCE LIMITS: ' ^ 
^ ^ ■ ' * - ' « \ ^4>^ s V ;^ • - ^ . ■ \ ^""^ ■ ■■ ^ 

k) 12'*!- 3.02 3) ^1/5, +! 3* 92 ; C) 12 I - 5 • 1 4 2 . ^ 

-D) ;1U5, +5' 5.-142 E) NONE OF THESE / : : i ' 




Question » .1 5.0 Point! 



d. ■ 



Ousstio,n I 2 
' TRUE 



5, 0 Roints 



HOW to Lie with Statistics "i^ 
Spring, ,1978 r'- ' ■ f M- 

.Exatn - 1 



Question • 3 



'QUastidn # a 5*0 Points > . ^ 

. ; True^ 95% confidence automaticall than 
. 90* confidence, 

..^ ^'v • - . . . / . . . ' ' ^ 

Question # "5 10,0 Ppints . ^ 

This question cannot be answered accurately without knowing the 
standard deviation for IQ scores. Then these differences from 
the mean or average can be evaluated relative to the standard 
deviation as to whether there is a significant difference between 
them, • ' ^ ^ ^ ' ^ 

Question #6 10,0 Points ^ 

There are raariy reasons for disagree ing, 

1, The Type 1 error rate level is very low {.5^ ^ ^005) which 
fflakes it more difficult to detect a difference, 

2, We do not know how the saiDpling was carried out ^ was. it random? 

3, We do not know what size saraples (or even if they were the same 
. ' size) were used. 



Question * 7 10,0 Points 
n ^ 9 
xbar ^ 25, m ^ 10 
SIGflA(xbar) ^5 

-Z ^ (25-10 )/5 ' 
= 3 



Area beyond ^ 3) ^ ,0013 

O^ob, ( someone obtaining an xbaT > 25 ) ^ ,0013 



Question 



# 8 - -10.0 Points . , 
^ /Answer - One of the main problems with surveys^ as 
in this particular^ case, is the response rate. The mail 
quastionnaire is generally the least successful in terms 
of response rate* Persona 1 intervie ws are generally' 
much more effective, but do present problem s with 
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. . r#apect to potential interviewer bias. Such probleiQs 
as i#ading questions, guestions which often ^ 
prorfuce socially acceptable reeponaes, and even subjects' 
attempts ^to yield expectea^^or favored, responses □tten \ 
• plague the daaign of an interview based , survey, , 

= ■ . ' . ■ . ■ . ^; ... . ■. .\. ^ . .. - - " 

The mail questiohnaire can often be structured to 
avoid niany of these difficulties, or sources of bias. 
However, if attempts are not made to increase the response 
rate, the sample may have a built-in bias. Various 
techniques used to increase response for mail surveys, 
generally revolve around aotoe effort to make the mailing 
appear T>ersonal, For example, using envelopes 
accurately addressed rather than simply " RESIDENT'*, ^ 
providing stamped return envelppes, enclosing an induce- 
ment, such as a pen or pencil, etc. 



Question i 9 10,0 Points 

The problem developed because the Digtfst relied on voluntary response 
and such samples are practically always biased. The respondints rap- 
resented 'fcnly that subset of the population wit h a relatively intense 
interest in the subject at hand and in no sense reprasents a random 
sample or representative sample. 



Question # 10 10.0 Points 
NONE PKQVIDEO ■ 



Question #11 10.0 Poirits ^ 
a ) mean - 8 - ' 

b^) median ^ 8 ^ . ' 

c) mode ^ 6 ■ V . ; ' 

d) range - 5 . 

e) variance ^ 26/(7-- 1) ^ 4.33 

V- 

Question » 12 10.0 Points * 
(NOTE: TABLE IS HEQUIRED) 

This is a^ Cross* reference between the question numbers on this exam 
and the quest ion ^numbers. in the File ^ 

1^ 37 2- 2 3- 29 52 5- 28 

6- 27 7- 53 8- 58 9- 6 10- 3 

11- UO 12* 48 > ^ . 



' • * - " now to Lie with statistics 

: \ 1^ ' ' Spring p 197d - 

^ = Ex affl - 2 . ' - 

• - = ■ . / . / ' i^' ' ,, ^, . .. " ' 

Quastion » 1 S.d Points ' , ' 

' If the population milee*^per week jogyad by people 

oyer 65 equals 10 and the standard error of the roe'an for a gicoup of 
/ 9 is 2 niile^ per weeJtf^what is the pro^ getting a. random 

■■\ i. sample of size 9 with' a ' mean greater than or equal to 25? 

a*^) about 50% 

b*)'- almost certain ' 
. ■ c. ) almost n^var , 

d» ) impossible, since you would never, catch then* 

Quastion # 2 5.0 Points' 

A simple random sample is one .where 

a) you decide on a sample siie and sample proportionately from 

' . the strata* ' ' ■ ■ ■ ■ ^ . ' ' : . .. ^ .. \ 

h) you choose each itpm with no Regard to previous choices, 
' c) each item in the population has an eyual chancs of being qhosen*^ 

d) all of the above are true* 

e) none of the above are true. 

Quest*ion # 3 5*0 Points ' / * ^ 

COMPUTE THE MEANp MEDIAN^ AND MODE OP THE FOLLOWING SCOBES: \ 
2 ,8^4,6^5,5 ^ 

Question # k 5*0 Points ^.^ . ' 

I The median of the 7 numbers £9, 1, 3, 0, 0, -9, 10) is 

i - - . ^ : ■ • 

1 1) equal to the mode ^ . ■ . - ' 

1 "2) equal to the mean 

\ ■ 3) 0 ' 

i 5) none of the above l 



istion # 5 10*0 Points 

For questions A to D, use the following data set: 7 , 3, 6 3# 1 #2 , 6,6, 

7, 2, 6, 3, a; ; • ^ 

a* What is the mean for this data set? 

b, what is the" median for this data set?* 

c, what is the mode for this data set? 

d* what is the range for this data set? » 

" ' ' ■ ' > . • / . " 

Question t .6 10*0 Points * 
Find the mode tor the following data seti 
Pord# Chtvy, Dodge, Ford, Oatsun# Datsun, Ford, Chevy^ Dodge* 
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yus^tioji « 7\ IQ.O Points v * . 

A floor muhager of a large department ato re is studying habits of 
theit customers* Suppose he has good reason to believt that an *; 
tstidiate of $600 fo'r the populmtion mean given to hio is wrong* , 
He' makes preparation to draw a sampl*e but lacks the funds to draw 
H ; ^ TOO as he had planned. Jlbw large a_. sample need he draw in 
• o rder to estimate t he population mean within JIOO of the trus 
value with a probability of 0* 95? (Assume SIGMA - J500) 



Question #8 MO.O Points 

The following are the reaction tinies in seconds of five people to a 
particular stimulusi 7#8^6^10^9, , 

af, Detetmina a '0^90 confidence interval for the mean reaction tima 
for all people to this stioul ua, , . " 

■ \ . ■ 

Question # 9. 10,0 Points 

. TROE OR FALSE? IF'FALSE, W H.Y? 

,\S A FIBST SrEP lW SAMPLING VOTOR PREFERENCE FOB PRESIDENT . 
THROUGHOUT THE U.S. , A SIMPLE RANDOM SAMPLINU PROCBDUHE SHOULD GljfE 
. MORE RELIABLE RESULTS THAN A; STRATIFIED SAMPL 
WITH THE SAME SAMPLE SIZE. • 



Question #10 10.0 Points' 

A manufacturer wishes to det«rinine the averay-i weight of a certain typs 
of product in ord'^E to design the proper packaye, what size *sampla is * 
required so that the ciak of exceeding an error of ,20 pounds or Bore ii 
,010? Assume that sig ma is 1 . 1 0 pounds, . 

Question » 11 10,0 Points ' - 

EACH aTH GRADER IN A SACRAMENTO ELEMENTABY SCHOOL WAS ASKED ; 
THE AGE OF HIS FATHER. THE RESPONSES ARE CLASSIFIED IN "THE 
FOLLOWING HISTOGRAM. " 



PREQUENGY 



10- 



! 
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! ! 
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! 
! 

! 3 ! 

j ^,--1 



! 



I 



2 5*^ 30 35 40. 
1 , AGE^ 

THE MEAN XBAR OP THIS CLASSIFIED DATA IS:" 
A) 35 B) 36. 25 ' C) 37.5 D) 37. 7 



45 



50 



E) NONE OF THESE. 



Qusstibn i 12 10.0 Points. " 
; , TRUE OR FALSE. FALSE EXPLAIN W'HV. 

If the cesults of an investigation show that one deeongestrnt 
"""^ works bettmfr is' no neea to know the level 

off aijgnificanae. \ ; , / 
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flow to" Lie with Statf sfeics 
Spring^ 108 
Exam - 2 



Question i 1 5.0 Points 

' • c.) almost nmwmt 

MU = 10 

. n = 9 

SIGMA (xbar) - 2 

% -■ (25 - 10) / 2 
- 7.5 ' V 



Area beyond (2 ^7.5) = approx. zero 



Question » 2 rS.^ Points [ ^ ." . . ^ 

c) each item in the population has an equal chance o* h|eing chosens 



Question* 3 5,0 Points 

none' PEOVIDED 



Question # 4 5*0 Points 

i (5)3 mode ^ Q . 

mean -2 ' 
median ^1 



Question # 5 10,0 Points 



a. 


4.46 




b. 


5 . * 


1 


c 


6 




d. 


^6 





Question # 6 10,. 0 Points 

Answer ^ Ford; the mode is 
value. 



most frequently occurring 

4~J . 



Question # "7 10,0 Points 

N = [Z (ALPHA/2) * SICHA/a']**2| e ~ tolerable error 
= [ (1.96) « 500/100 ]**2 ' 
= 96 



yuestlon » 8 10,0 Points * 
a, XBAR =; ,40/5 = 8 - ' 

S(X)**2 = [(-1)**2 + (0) **2 * (2)**2 + (1)**2]/4 
- 10/4 , ^ 

= 2.5 ' 

C.I. = XBAR +V- t (ALPHA = .10)*S(XBAB) 
= 8 +/- (2. 13 * .707) 
= 8 +/- 1. 51 
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6.49 < MU 



■Question. 9 10% 0 Points 

NONE PROVIDED . , ^ 

i [ : ■ ^ ' ^" . ■ ■ . ■ 

QuBStion t 10 '10,6 Points 

Using Z = ixBAR - mU) /(sigma/SQBT n) , ' 

we get n ^ ( (Zf CsigDa)/(XBAB - nu) ) **2 
n = H2.S7b) (1. 1)/.2) **2 

. , n> 200.73 

Therefore, a sample size of 201 is reyuired. 



QUtstion #11 ,10.0 Point; 
D 



Question #12 10,0^ Points ■ * ^ ' - 

False^ .any comparison o£ differences between two treatments 
will be significant at some speclfi^eci lev^. The real issue 
' is how signifieant are the results, A statement such as "using , : 
a confidence level of 95%^, ^rand A was found to be signifi- 
cantly better tha n Brand at providing relief" clearly indicates V 
the significance of the result and implies that such a reiult couli 
occur only one time in 20 i£ the tw6 brands were really equivalent* The 
statement "using a confidence level of SOX/ Bgand A was found to be 
siqoif icantly better than Brand B at providing relief" is not nearly as 
persuasive. It indicates much less confidence in the conclusion in thai 
sense that if Dr^nds A and B were eguivalent; differences such as 
that obfeerved would occur 10 tinies in 20^ or 50% of the tine* 

This is a Cross-reference between the question nutnbers on this exaiQ 
and the question; numbers in the File 

1- 54 2- 7 3- 16 4- 17 5- 19 

6- 62 7- 35 'b-. 46 9- 10 10- 34 ^ 

11* IB 12- , 51 ■ * ^ 



■■ ' ' ■ . , : How to Lie with statistics ■ ' 

• Spring , 1970 ' ' 

. .■ ■ Exam - 3 

Quaatiofi /# 1 ^ 5.0 Points ■ / : , V;y 

If a'n InVeatigator wishea to detfceast the size of his sanple 
* which of the fDllow*ing can he do? ' ; 

a*) increaae the tolerabie error ^ = * 

b.) deccease the population variance^ ~ n 
' o.) deerease the tolerable error 

' " ■ . . ^ ^ ^ ■ ^ - . ' ■ ■ .. ' ■ 

Question # / 2 5.0 Points / 

■ TBOE OR FAtSE? IF FALSE, CORRECT IT. 

No matter fr^w convincing it may seem one .cannot aa%ely rely on data 
^ unlesa you know how it was gathered. , 

Queation # 3 5^p Points ' \ 

For a certain normally diatributed population, the^Talue of th6 'standard 
deviation is known, but the value of th4 msan is unknown. . What will 

the effects of changes in sample si^e and in the confide^GS 
coefficipnt on the length of the confidence interval estiBiate of the 
popuiatJon mean? * x " 

. ; a, * Increa^ing^aarople aize increasea the length# given a fixed 
4 coefficient. ' ^ ^ 

b. Increasing the confidence coefficient decreases the length 

given a fixed sample size* g - 

Cm Increasing sample size decreases the length, given a fixed 

coefficient. ^ V * - ^ / 

a. None of the above 

■ - . ■ ■ ^ . ' ■ ' ^ ■ ■ ■ V ■ ■ ■ - 

Question . # y4 /5,0 Points 

Fin^ the mode for :the following da ta set: 
/ Ford, ChW^y, Dpdye, For4r Datsun, Datsun, Ford, Chevy, Dodge. 

Question # 5 10,0 Points 

Consider this class as a sample from a' population taking this exam, 
♦•tfaif of the grades for this class will be below average." , - 

a. What estimator is referred to i^n the above statement? 

b. If we assume that grades will have a symmetric distribution, 
what additional estimators might be used for the same purpose? 

c. why- is the mean often not used to iri^^cate average income? > 
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Quastion » 6 10.0 Points ' _ ■\ \ ' : 

> caller on a rmdiq talk show xeports ha ving surveyed yotecs in 
.his neighborhoo.d and finding: j' - ' 

^ . 66. 2/3* f^vor candidate V 

^ . 33 1/3% favor candidate D, ' . . 

3 3 1/3% more vpters favo^ cand'idate so he is expected to 
. carryLtUe neighboThood4 v ^ 

Dp you aacept tli^s donclu^l^n? Why? ' " v - - — ; . - 

' ' . \ ' - • . • ' . 

QUastion 7- ^^10*0 Points: ' ' 

/ Define th« following term and give an example df its use, 
Your example should not ^be o^ne given in class or in a Handout* 

Standard error of a mean ^ 



Queatdorj » 8 10,0 Points , 

Define the term "stratified sample*! /and explain 
why it would be useful in the following situatddn. A 
^ company is composed of many small plants located through-'*' 
out the United States. A V ids Pr es id en t of the company 
wants to determine the opinions of , the employees on 
the vacation policfy. 



'Question « ' 9 10.0 Points , V 

HEAbLlNE: MPU for Gas (Suzil-ar skyrockets ovse\ 

• , ■, . > .. ' . MPG, for Econ Scooter!: . ;/ ■ 
■ ..■ ;■■ . L.^ft-V-. - ■ .... . . , - • 

Data: in niles per;' gallon • ' 



Gas 

Guzzler 



Econ 
Scooter 





196U 


4 


25 




1968 


5 


30 




1 972 


a 


, 35 




1976 


16 


ao 






100 ! 






Percent 


1 





l^n crease 1 
Over 75! 
Prior ! 
Time ! 
Period 50! 

- ! 
! 

25! 
! 

' ; - ! 



1968 



1972 



1976 



. {To complete the graph coiinect the three G points with 
straiyht lines to relate the performdnce of Gas Suzzleri 
Siinilarly/ connect the three E points to show the trend 
for Econ Scooter*) 

E¥.en thaugh the above graph is correct^ explain 
how it has led to the misleading headline, ■ ' ' , ' ■ 



Question * 10 10.0 Points 

IDENTrFY^TIiE ERROBS IN THE GRAPJl, 



QuiBtion ^ ^^ 10*0 points ' \ 

In statistics we speak often of a r^naom sample. What is a 
randoin sample^ and why, is randomi za tlon so importaht in . 
statistics ? 
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Questibn i 12 MO^O Points ^ " / i . . 

A teacher is cdnductiny .two 'sections, of the same course, As the 
. teacherl ex4)eriencfs the two; sections^, it is his/het b'©rief that 
.the students in aectlon, one ace not i^iny as Will ks th^ studervts 
t in section Cwo* As a preliminary ^A^t of whether %/she ahould 

approach the two sections fiifffrently the teacher administeta a ;> 
' 4uiz with 20 qaestions mi it, t^ bbth^ sections on the- same -day, 
: Herf piC^ the scores for 'both sections, ; ^ 

Sectron^r: 20, 19^ 18, 10,* 17, 16, 16,. 16; 15^ 13 ^- 
Section 2t 19, 19^ 19^ 17, 16, 16, 15, 15, 14, 14 

Should^ the .teacher consider ^theVSQ^^^ dif ferent or about 

. the same? ' why? ' 'f\'.v :^-.rv ' . : \ 
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How to Li^;4w?itt^ 'Statistics 
Spring, v:i-978=^ f;--,^^' , 



QiiegtioQ -» • % 5.0 PoiiitSr ; 

a«)V increa&^lffcbe" tolerable error 
>^ . Llncreasiftg It^^^ffioiirit of errot which will be , 
/Its t^J^terd^ed aM^ws-^or less preciaion in the results, thera 
' . ^raguiijj^g a smallef sample, sizt* Decreasing the tolerable 

wg{a^4/4i^|fe the opposite effect; and obviously ^he ipvestiga 
has fi6 'con trol over the variance in the ' popula tion * 




Question # 2 
T r lie . 



5*0 Points 



ftuastion 3 5*0 Points 

c# Increasing sample size decreases the lehgth, 'given/ a fliid 
coefficient . ^k-..' ^ * V ^ i 



Question # 



Answer t 



V a 1 u e 



5^*0 Points ' 
For d ; thp mode 



is the most frequently occurring 



Question 10.0 Points - 

a* The mean if you use* the term "average" loosely; more p^roperly^ 
it is the median^ since by definition it separates afgroup of ^ 
- scores into two ecjual^sized groups of values* 

b. The mode and mean. * ' 

, '■ ■ ^ A. . - ' ■ 

c* ' The raea^n is* not; often used to Indicate the a'vetage income' btcattse. 
the distribution of incomes like many other variables* tend 
to be skewed to the right* This results from the fact 'that most 
' people ha ve low or rtiddle incomes . whi le a small percentage have 
very high incomes. Therefore* more people will be closer to 
the minimum than the maximum* The mean is highly influenced by 
' . this small| number of ex tremely largfe values* hence* the pedian^ 
income Is coralfionly' reported* » :,v ' " / 



Question # 6 



No* 
was 



10*0 Points ^ ; , 

I don't acpept it because I have' tio idea what size sampre 



used * 
3/ 



and 



ighi;. ^e. suspicipus enough to 



think 



tha t- it 



.Quest ion* # 7 

OeCinit ion: 



10*0 Points 
A m ea su^re of 
s ample 



va riation' among val ues for siAm pie .means for 
particular sample size* The usual method , used' to caldfilate . 
a standard error of a mean f or sample , size n i^s^, to obtain 
a variance for Individual sample elements (5*^2) * convert t 
it to A Variance of a, mean for the desired sample'siza 
(Divide S**2 by n) *and take the^ squats root of th at . vaf i ance. 
Notice that the same value tot S*^2 can Ids used to calculate 



. ^errors of a' rn^n' f^b seyerai; dLffece^^^^^ 

iSicainple: -SuppDse that a randttm iample uf size 1^ has ptovided a - 

. / variranco ©f 25 with 9 .if. . Tlhat variance of 25 provides 

: V the basis f or calcula ting ^ , / . . 

s Ai Standard error of ■ a maan for a ^ampie si^e^bf 

. : ^ , _ ^ ^. ^ ^ ' ^ ^ :.^^^':y'^''^. 

' - B% standard^ errot o£ a ^nean^ f or a sampi^ ' sii^ Ho^^ 

V . ' 16 1.2-5 7 ■ \. j:::'^^;;V:-^' ^ J ^ 

C: standard erro^ of a* mean for a saaple size of 

. ' ' ' 25 = 1,00 - ^ ..^^^ ^ ■ ; ^- ^ . ' 

. ^ >■ ■ ■\ . * : ^'r^ ■ ^ - . ■ ^. ■ _ • . 

Symbola: . S (YBAR) ^ ^ CXDAH) ^ 3IG HA ( If BAfif , SIGMA (XBAR) , . , • 

■ . -^v- v^;-^ . * =■ . : ^ • , ^ 

Question # -0*^^ -^^^1^ P^^ ' ^ ^ ^ ^ . 

. - / Answer - A stratified ^Sample whi^cht has been U 

' >6.btainsd by a. p^Oucedu^ in /yhich the f rame j^s divided intb \ 
non ovstlapplng' categories (stMt a)', / . sAmpiing units are- ^ 
then •selected at pandort from each stratum thus assAiring^ . ■ 
that all st ra.ta ^a re representedV ^n 'the sample, ^ ^ 

/ For the given J^roblem^ : I wo^l^ suggest, a type of . ■ 

stratified sampling pfaceduxe, : S pitif ically, I would . ■ - 

recommend that each plant be, considered a stratum and a 
. ra^dom^ sample olataine^d within each stratum to insure ^tha t' ^■ 
all plarfts arerfepresehted in the sample* . /I would . 
. further suggest that the 'sample from each strat Urn rf present * 
the proportional size of that stratumV For exampleV 
if plant A employs 2511 of the total company 's emplDyees ^ 
.-..then the saimple from plan^^A should represent 25» of , the . 
::iv'^^;-\otal sample ab - f \ - ' 

J ^ . ^ ^ ^ - ^ ' ^^^ y : 'y ' ■ \ V 

giiVstion # 9 ,^^. lO^O . pg^ijitsi ^ ^ ^ * 

Anivrfe)r' * ahfe Keifedi^^ is misleading ^n the sense ^ ; ^ . 
tha*t it implies mpg'is being' compared for ^he two vehicles, 
only uppn inspection of^ th^ data can one see that Ecpn ^ rr ^ 
"snQoters hav#%a Substantially higher mpg^ while tlieir rate 
r -^of increasingi.m.pg has not been great* 'rihe graph . 
-V accurately indica tes ^ the ra t>i q^" increase in mpg^ but the 

^headllne is comparing actual^^g^ which is quite dif f erent^gN; 

Question # 10 10,0 Points ■ . / 

' ,T,r:Ii|-N,|;.: ours IDE AXIS ■ ' ' / ■ „ , ^. ■ 

^2Vi:'S;i:a'PPYy:I.A:D5L {ESPfiCiALW,.yERtICfk a^w ' . . 

3.M0. ElTiiE . r, ' : .^l' ^ " ■ * . : 

/■..a. 'POOR LEamp ^ ; ^ ^ ' , ., ' / ; \' 

Question * 11 10*0 points , _ 

.A .random sa mple occurs when every observation in the populatioii _ 
has a Icnown (usually egual) change of becoming part of the 
... sample*^ na'ndpmization Ife import because the prDBabilities of - 
- sam^lis-^seldcted in a ra^^^d^^^ known^ %hus enabling Oiie to . 

."'V m^ki :v^lid in«^^ from which the sample. 

:\=w.as\'drawnv ;/*'"v\'V'." ^- \ ' \ ■ ■ ■ ' - ^ 



r ' .X4 ^indidates Section ' : * 
: "^ sum (3t (i) ) ^168 

r sun (X(i) ) /n » XBAR '= 16,8 /^ ' 

, ■ : . . i ■ . , ' 

S (X) **2 - 4i1773 :« ' , 



Y 1 n (1 i c a t es S e et i o & 2i 
^uin (Y (i) ) = 16^ . - 

sum (Y (i) )/n - 16, I*" 

'■^rB (Y)«*2 ;a ' a.OUtta ' \ 
S (Y) ^ 2. 0111 • 



Assu,M-hg homogeneity of vaEtance and since aamplas are,.tl^e same 
size {there is no need to pool, Ws get: , ■ „ 



S(XBA1 - .YBAn) **2 = S fX) **2/n + S{Y)**2'/n ' . f ' 

; . .' - u. rr/B/io ♦ a.ot»U4/iO . . . w-^ ' 

. ' ■ • ' . , . ■ * . 822 . ;;■ . . , * ■ J 

□ siny a: two tail t-test yitK ^alpiva: = ^-»05 J^ft^ = ia, we test: 



H tO) ■ mu (X) ^ an (Y) =0 
H tl.)? rau(X) ^ aiU(Y). 0 



f = (16,8 - 16.4) - q/SQFT(.a22) = . '4/. ')07 = ,44fl 



,; ■ I 



- tCcritical) = 2,-101 ' . . ' 

Therefore continue, H (0)(, or .consider t he sections t0 te vabout . 
^"^^fc he ; sa'me. ■ / . ,. ' 

Thi'S is a Cross*referenc6 between the guestion numbers .on this exam' 
■ and the ■questi'on numbers in the File , 



1- 
6- 
11- 



57 
24 

i 1 



2- 
7^ 
12- 



1. 
30 
42; 



?3- 

■8- 



43 

60 



■4- 
9- 



63 
64 



5- 



13 
21 



\ 
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